CDS

Accession Number TCMCG078C09648
gbkey CDS
Protein Id KAG0464364.1
Location complement(join(23721537..23721929,23728093..23728205,23728411..23728486,23728582..23728767,23734975..23735055,23735162..23735242,23735332..23735397,23735484..23735549,23735990..23736040,23736172..23736264,23736606..23736733,23736841..23736979,23744407..23744504,23744668..23744737,23744823..23744903))
Organism Vanilla planifolia
locus_tag HPP92_020433

Protein

Length 573aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000010.1
Definition hypothetical protein HPP92_020433 [Vanilla planifolia]
Locus_tag HPP92_020433

EGGNOG-MAPPER Annotation

COG_category TU
Description Epsin N-terminal homology (ENTH) domain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K20043        [VIEW IN KEGG]
ko:K20044        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGGCTCGTGGCGGAAGGCGTATGGAGCGCTTAAGGATTCTACAAAGGTTGGACTTGCAAAGGTTAATAGCGAATTCAAGGAATTAGACATTGCAATCGTGAAGGCAACCAACCACGAGGAATGCCCACCAAAGGAGAGGCATGTGAGAAAAATTTTTGCTGCTACATCTGTTGTTAGACCACGGGCCGATGTGGCTTATTGCATATTTGCACTTGCTAAAAGGCTGGCAAAGACACGCAACTGGGTGGTTGCATTGAAAACATTGATAGTGATACATAGGACATTACGAGAAGGCGATCCCACCTTTCGAGAAGAACTTTTGAATTACTCTCAAAGAGGAAACATTCTGCATATATCTAACTTTAAGGATGATTCGAGTCCGCTTGCTTGGGATTGCTCTGCATGGGTTCGAACATATGCTCTTTTCTTAGAAGAAAGATTGGAGTGTTTTAGAGTTCTAAAATATGATATTGAGGCTGAGCGGTTGATGAGAAATGCACAAGGGGCTCCCAAGGGTCATAGTAGAACAAGAGCCTTAGCTTGTGCTGATCTACTAGAGCAAATGCCTGCATTGCAACAACTTTTGTATCGGCTTGTTGGATGCCAGCCTGAAGGAGTTGGATGTGGAAATTATCTGATACAATATGCTTTGGCATTGGTTTTAAAAGAGAGCTTCAAAATTTACTGTGGTATCAACGATGGGATCATCAACCTAGTTGACATGTTTTTTGATATGTCAAAATACGATGCCATAAAAGCCCTTGAGATATATAAAAGAGCTGGCCAACAGGCAGAAAGCCTTTCTAATTTTTATGACTTCTGCAAGCATCTGGATCTTGCAAGAAATTTTCAGTTTCCAACTCTTAGACAGCCACCTCCATCATTTCTAGCAACAATGGAAGAGTATGTTCGTGAAGCGCCACGTTTTGCTTCTACTTCAAGAAAGAACATAGAATATGAAGAGAAGAATCTTTTAACTAGCAAAGAACACGAATTAGAAGTACAACCTAGTATATCAACAGAAAATCCAGAGCCAATAGCAGAGGAAAAAGAAGAGAATGTAGAACCGGTGCAAGCTGAGGAGCCACCACCAGTAAATGAGGTTAAAACAGAACCTCAGAATACAGGAGATCTTTTGGGGTTGGATGTGGTAAATCCTGTTGCTGCAGAAATTGAGCAAAGCAATGCGTTGGCTTTAGCAATCATTCAGCCTGGAGATGAGTCAAAGCCAACAGCATCAACAGATCTACTTGGTGGTCCGGGATGGGAGTTAGCACTTGTGACAACTCCAAGTAATAATACTAGCCAAGTTGTGGAAAGCAAGCTGGGTGGAGGCTTTGATAAACTCTTGCTCGATAGCCTTTATGAGGATGCATCCAGGAGACAACAAATCGCTGGAGCTACCTATACTGGCAACTTAAACGCTAATCCATTTGATGTGAGAGATCCATTTTCCATGTCAAACTATATTGCTGCTCCACCAAACGTTCAAATGGCTCTAATGGCACAACAGCACCAACAACAACACCATCAACAGCAACTATTATACTATCAGCCTCAACAGATGTACTATCAACAGCAGCAACAGATGATGTTACCTTATGGATATCAAACTCAAAATCCCCAACAACAGCTAAGTTTGACAAATCCTTTTGGTGATTCTTCCAGTGTTAGCTATCCACATGGTGCTTCTATGCAAGGGAAGTCTAGTTTGCTCTGA
Protein:  
MGSWRKAYGALKDSTKVGLAKVNSEFKELDIAIVKATNHEECPPKERHVRKIFAATSVVRPRADVAYCIFALAKRLAKTRNWVVALKTLIVIHRTLREGDPTFREELLNYSQRGNILHISNFKDDSSPLAWDCSAWVRTYALFLEERLECFRVLKYDIEAERLMRNAQGAPKGHSRTRALACADLLEQMPALQQLLYRLVGCQPEGVGCGNYLIQYALALVLKESFKIYCGINDGIINLVDMFFDMSKYDAIKALEIYKRAGQQAESLSNFYDFCKHLDLARNFQFPTLRQPPPSFLATMEEYVREAPRFASTSRKNIEYEEKNLLTSKEHELEVQPSISTENPEPIAEEKEENVEPVQAEEPPPVNEVKTEPQNTGDLLGLDVVNPVAAEIEQSNALALAIIQPGDESKPTASTDLLGGPGWELALVTTPSNNTSQVVESKLGGGFDKLLLDSLYEDASRRQQIAGATYTGNLNANPFDVRDPFSMSNYIAAPPNVQMALMAQQHQQQHHQQQLLYYQPQQMYYQQQQQMMLPYGYQTQNPQQQLSLTNPFGDSSSVSYPHGASMQGKSSLL